CDS

Accession Number TCMCG074C20597
gbkey CDS
Protein Id KAF8402204.1
Location complement(join(25813253..25813511,25813777..25813854,25813947..25814056,25814245..25814673,25814766..25814788,25815164..25815234,25815455..25815501))
Organism Tetracentron sinense
locus_tag HHK36_013156

Protein

Length 338aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA625382, BioSample:SAMN14615867
db_source JABCRI010000008.1
Definition hypothetical protein HHK36_013156 [Tetracentron sinense]
Locus_tag HHK36_013156

EGGNOG-MAPPER Annotation

COG_category U
Description AP-4 complex subunit
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K12400        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04142        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005829        [VIEW IN EMBL-EBI]
GO:0005911        [VIEW IN EMBL-EBI]
GO:0009506        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0030054        [VIEW IN EMBL-EBI]
GO:0030117        [VIEW IN EMBL-EBI]
GO:0030119        [VIEW IN EMBL-EBI]
GO:0030124        [VIEW IN EMBL-EBI]
GO:0032991        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044425        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0048475        [VIEW IN EMBL-EBI]
GO:0055044        [VIEW IN EMBL-EBI]
GO:0098796        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTATTGTCATCACAACACGGATTACAATCTCAGCTATCAAAAAGGCTGTGCAATGAATGACTACATAGAAGCAGTTAGAGAAGTAGCATGTGAGATTCTGGAACTAATGGCAGAAGCGTTCATTCTTTCCAGGCCTATGGTAGATAAAAGTTTCTCGTTCCTGAACAATTATGTTCAGCAGTCCTTAGAAAAAGGAGATCGGCCATACATCCCTGAGAATGAGCGGTCTGGAATGTTAAATATCAACAATTTTAGAAGCCAATACCAACATGAGGCTTCTACACATGCTCTCAGGTTCGAAGCATACGAGCTTCCACAGCCCTCAGTGGCATCAAGGATTCCTTCAGTTCCAGTTCCACTTGCATCCTCAACAGAACTCATGCCAGTATCTGAACCCATTTATCCTAAAGAAATCCACCAAGTTGCATCATTGCCATCTGTTTCAGATACAAGATTAGTAGAACTCAAGCTTCAGTTAGAAGGGGTTCAAAAGAAGTGGGGTAGGCCAACTTACTCCTCTCCTGCACCATCTACCTCAAGTTCTACTACCCAGAAAACAGTGAATGAGATCTTCAAGATTGGGATGCCCAAGGCCAGTGAGTTCTGGAAGATGCCCCGACAGGCGATTTATTTGGAAAACAATGACTTTCACCGAGGAAATGTTGAAGATTGCTTGAGACATGGTGGATTCTGGGTTTCTGGGAAACAGATCTGCACTCTGCTTGGAATGATTGGGGGCTGTAGAGAGGTGAATAAGGAAAGGAAGGAAGGATCTTGGAGAAACCGATGTTTTGTGGGTATGGCAGCATGTATGATTATAGGGTTAGAGACGGGGCTTTTGAGCGGCGAATGTCATGCCTTCACCGAGGACTCACAGATTATAACTGTGGCTGAAGGGAATAAGCTTGTGAGATGGAGCGATAAAAGAATGTGCCCCCCTTGGCAAATGAATTCATTGGAGATCATTGTCCCTGAAAACCTTCCCAGACCTTCCGCTCACCGGAAGTTGGTTTAG
Protein:  
MYCHHNTDYNLSYQKGCAMNDYIEAVREVACEILELMAEAFILSRPMVDKSFSFLNNYVQQSLEKGDRPYIPENERSGMLNINNFRSQYQHEASTHALRFEAYELPQPSVASRIPSVPVPLASSTELMPVSEPIYPKEIHQVASLPSVSDTRLVELKLQLEGVQKKWGRPTYSSPAPSTSSSTTQKTVNEIFKIGMPKASEFWKMPRQAIYLENNDFHRGNVEDCLRHGGFWVSGKQICTLLGMIGGCREVNKERKEGSWRNRCFVGMAACMIIGLETGLLSGECHAFTEDSQIITVAEGNKLVRWSDKRMCPPWQMNSLEIIVPENLPRPSAHRKLV